Document Expansion for Text-based Image Retrieval at WikipediaMM 2010
نویسندگان
چکیده
We describe and analyze our participation in the WikipediaMM task at ImageCLEF 2010. Our approach is based on text-based image retrieval using information retrieval techniques on the metadata documents of the images. We submitted two English monolingual runs and one multilingual run. The monolingual runs used the query to retrieve the metadata document with the query and document in the same language; the multilingual run used queries in one language to search the metadata provided in three languages. The main focus of our work was using the English query to retrieve images based on the English metadata. For these experiments the English metadata data was expanded using an external resource DBpedia. This study expanded on our application of document expansion in our previous participation in ImageCLEF 2009. In 2010 we combined document expansion with a document reduction technique which aimed to include only topically important words to the metadata. Our experiments used the Okapi feedback algorithm for document expansion and Okapi BM25 model for retrieval. Experimental results show that combining document expansion with the document reduction method give the best overall retrieval results.
منابع مشابه
DCU at WikipediaMM 2009: Document Expansion from Wikipedia Abstracts
In this paper, we describe our participation in the WikipediaMM task at CLEF 2009. Our main efforts concern the expansion of the image metadata from the Wikipedia abstracts collection DBpedia. Since the metadata is short for retrieval by query words, we decided to expand the metadata using a typical query expansion method. In our experiments, we use the Rocchio algorithm for document expansion....
متن کاملDocument expansion for image retrieval
Successful information retrieval requires effective matching between the user’s search request and the contents of relevant documents. Often the request entered by a user may not use the same topic relevant terms as the authors’ of the documents. One potential approach to address problems of query-document term mismatch is document expansion to include additional topically relevant indexing ter...
متن کاملDocument Expansion for Text-Based Image Retrieval at CLEF 2009
In this paper, we describe and analyze our participation in the WikipediaMM task at CLEF 2009. Our main efforts concern the expansion of the image metadata from the Wikipedia abstracts collection DBpedia. In our experiments, we use the Okapi feedback algorithm for document expansion. Compared with our text retrieval baseline, our best document expansion RUN improves MAP by 17.89%. As one of our...
متن کاملTélécom Bretagne at ImageCLEF WikipediaMM 2010
In this paper, I describe the approach proposed by Télécom Bretagne for the WikipediaMM 2010 evaluation campaign [6]. One of the main challenges in large scale image retrieval is the mismatch between query terms and image textual descriptions from the database. This mismatch can be reduced using query expansion and here I present a Wikipedia based query expansion approach. In order to boost res...
متن کاملDEU at ImageCLEF 2009 WikipediaMM Task: Experiments with Expansion and Reranking Approaches
This paper describes participation of Dokuz Eylül University to WikipediaMM task at ImageCLEF2009. This year we concentrated on two main topics: First is about expansion of native document, term phrase selection and query expansion processes which is based on WordNet, WSD and WordNet similarity functions. The second is a new reranking approach with Boolean retrieval and CM based clustering. Exp...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010